Inference with Transposable Data: Modeling the Effects of Row and Column Correlations

نویسندگان

  • Genevera I. Allen
  • Robert Tibshirani
چکیده

We consider the problem of large-scale inference on the row or column variables of data in the form of a matrix. Many of these data matrices are transposable meaning that neither the row variables nor the column variables can be considered independent instances. An example of this scenario is detecting significant genes in microarrays when the samples may be dependent due to latent variables or unknown batch effects. By modeling this matrix data using the matrix-variate normal distribution, we study and quantify the effects of row and column correlations on procedures for large-scale inference. We then propose a simple solution to the myriad of problems presented by unanticipated correlations: We simultaneously estimate row and column covariances and use these to sphere or de-correlate the noise in the underlying data before conducting inference. This procedure yields data with approximately independent rows and columns so that test statistics more closely follow null distributions and multiple testing procedures correctly control the desired error rates. Results on simulated models and real microarray data demonstrate major advantages of this approach: (1) increased statistical power, (2) less bias in estimating the false discovery rate, and (3) reduced variance of the false discovery rate estimators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplement to “Inference with Transposable Data: Modeling the Effects of Row and Column Correlations”

A common error measure for controlling the number of false positives in microarrays is the False Discovery Rate (FDR). This is the expectation of the False Discovery Proportion (FDP): let V be the number of false positives and R be the total number of rejections, then q = FDR = E(V/R|R > 0). Typically, investigators seek to control the FDR at q = 0.1, meaning that on average 10% of rejections a...

متن کامل

Stagewise Modeling of Liquid-Liquid Extraction Column (RDC)

Stagewise forward mixing model considering coalescence and redispersion of drops was used to predict the performance of Rotating Disc Liquid-Liquid Extraction Contactors. Experimental data previously obtained in two RDC columns of 7.62cm diameter, 73.6cm height and 21.9cm diameter, 150cm height were used to evaluate the model predictions. Drop-side mass transfer coefficients were predicted appl...

متن کامل

Row/Column-First: A Path-based Multicast Algorithm for 2D Mesh-based Network on Chips

In this paper, we propose a new path-based multicast algorithm that is called Row/Column-First algorithm. The proposed algorithm constructs a set of multicast paths to deliver a multicast message to all multicast destination nodes. The set of multicast paths are all of row-first or column-first subcategories to maximize the multicast performance. The selection of row-first or column-first appro...

متن کامل

Adaptive Neuro-fuzzy Inference System Prediction of Zn Metal Ions Adsorption by γ-Fe2o3/Polyrhodanine Nanocomposite in a Fixed Bed Column

This study investigates the potential of an intelligence model namely, Adaptive Neuro-Fuzzy Inference System (ANFIS) in prediction of the Zn metal ions adsorption in comparision with two well known empirical models included Thomas and Yoon methods. For this purpose, an organic-inorganic core/shell structure, γ-Fe2O3/polyrhodanine nanocomposite with γ-Fe2O3 nanoparticle as core with average diam...

متن کامل

Adaptive beamforming in row-column addressed arrays for 3D ultrasound imaging

In recent years, to reduce the complexity of implementation, the use of 2D arrays with restricted row-column addressing has been considered for 3D ultrasound imaging. In this paper, two methods of adaptive beamforming based on the minimum variance method are represented in such a way that the computational load is much less than using the full adaptive beamforming method. In both proposed metho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011